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1.  Introduction 

As  in  many  other  fields  of  engineering,  the 
design  of  ocean  or  marine  structures  is  governed  by 
extreme  values  of  wave  heights.  Several  methods 
have  been  given  in  the  past  for  the  determination 
of  design  values.  However,  no  method  is  widely 
accepted  by  the  engineering  community. 

Traditionally,  the  analysis  of  yearly  maxima  has 
been  considered  as  a good  method  for  this  pur- 
pose. However,  recently,  peak  value  methods  arose 
as  a promising  alternative. 

The  aim  of  this  paper  is  to  compare  these  two 
methods  and  illustrate  some  of  the  problems 
related  to  their  use. 

2.  Two  Standard  Methods  in  the 
Determination  of  Wave  Height 
Design  Values 

In  this  section  we  analyze  the  following  two  well 
known  procedures  for  obtaining  design  wave 
heights:  the  peak  value  method  and  the  yearly  max- 
ima method. 


The  first  one  employs  the  peak  wave  heights  of 
individual  storms  and  thus  composes  a set  of  ex- 
treme wave  data.  The  second  one  uses  the  yearly 
maxima. 

Several  authors  have  criticized  the  second 
method  in  that  it  discards  large  wave  heights,  when 
they  occur  in  years  with  large  storms,  but  includes 
relatively  small  wave  heights  which  are  maxima  of 
calm  years. 

2.1  The  Peak  Value  Method 

This  method  consists  of  the  following  steps: 

1.  Fit  the  peak  values  of  individual  storms  to  a 
parametric  family  of  distributions 

Fo(x;  Ao,  So,  Po) , (1) 

where  Ao,  So  and  fio  are  the  parameters.  In  some 
cases  these  three  parameters  can  be  reduced  to' 
two  or  even  to  a single  one.  The  fitting  of  the 
above  family  can  be  done  either  by  using  all  data 
or  only  tail  data  (Peak  over  threshold  (POT) 
method). 
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It  is  worthwhile  mentioning  that  this  distribu- 
tion corresponds  to  the  wave  height  of  a storm, 
that  is,  we  assume  that  the  cdf  of  the  maximum 
wave  height  of  storms  is  Eq.  (1). 


• Confidence  interval  determination 

• Outlier  detection 

• Treatment  of  incomplete  series 


2.  Use  the  following  cdf  for  the  maximum  wave 
height  in  a period  of  duration  D years: 

Fo(a:;  A„,  5o,  iSo)""*,  (2) 

where  k is  the  mean  number  of  storms  per  year, 
or  determine  the  wave  height,  Xt,  associated 
with  a return  period  T,  that  is,  solve,  forx,  the 
equation 

Fo(x;  Ao,  5o, /3o)*  = l-|;.  (3) 

Note  that  the  cdf  in  Eq.  (2)  implies  the  assump- 
tion of  independence  of  storms. 


3.1  Some  Distribution  Families  Used  in  the 
Analysis  of  Wave  Data 


The  most  common  used  distributions  in  the  anal- 
ysis of  wave  heights  are  the  following; 

1.  The  Gumbel  family 


F(x;  A,  5)==  exp 


— 00  <x  < 00 


(7) 


2.  The  maximal  Weibull  family 


2.2  The  Yearly  Maxima  Method 
This  method  consists  of  the  following  steps: 

1.  Fit  the  yearly  maxima  to  a parametric  family  of 
distributions 

Fi(x;  Ai,  6i,  (i\) , (4) 


F(x;  A,  5,  /3)  = exp  I - } ; 

x^A  (8) 

3.  The  maximal  generalized  extreme  value  or 
Jenkinson’s  family 


where  Ai,  5i  and  /3i  are  the  new  parameters. 
This  is  equivalent  to  assuming  that  the  yearly 
maxima  follow  a distribution  which  belongs  to 
Eq.  (4). 

2.  Use  the  following  formula  to  extrapolate  to  the 
maximum  of  a period  of  D years: 

F,(x;  A,,5i,)3i)^,  (5) 


F (x ) = exp 


kA 


1 + 


kA 


^0 


4.  The  minimal  Weibull  family 


(9) 


or  determine  the  wave  height  x associated  with 
a return  period  F,  i.e.,  solve  the  equation  forx: 


F(x;  A,  5,  /3)=l-exp  {-  (^-^)  ; 


Fi(x;  Ai,  5i,  /3i)  — 1— 


(6) 


X ^ A 


(10) 


3.  Some  Problems  Related  to  the 
Data  Analysis 

In  the  analysis  of  data  one  has  to  deal  with  some 
problems.  Among  them  we  mention  the  following: 

• Selection  of  the  families  Fo(x;  Ao,  6o,  jSo)  or 
Fi(x;  A],  5i,  P\) 

• Estimation  of  the  parameters  of  the  selected 
families 


The  Gumbel,  maximal  Weibull  and  maximal 
Jenkinson’s  families  are  justified  from  a theoretical 
point  of  view,  because  they  are  the  limit  distribu- 
tions for  maxima  (see  Galambos  [5]  or  Castillo  [2]). 
It  is  interesting  to  note  that  the  Jenkinson’s  family 
includes  the  other  two,  as  particular  cases,  and  the 
maximal  Frechet  family  (for  A:>0).  The  Frechet 
distribution  is  not  justified  in  this  case  because 
wave  heights  are  physically  limited,  no  matter  we 
deal  with  shallow  or  deep  waters  (see  Castillo  and 
Sarabia  [3]  and  [4]). 
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The  minimal  Weibull  distribution,  though  widely 
used,  is  not  theoretically  justified  in  the  case  of 
maxima.  Its  only  justification  is  that  its  range  can 
be  made  to  be  consistent  with  the  positive  charac- 
ter of  wave  heights.  In  addition,  we  remind  the 
reader  that  it  belongs  to  the  maximal  domain  of 
attraction  of  the  Gumbel  type,  i.e.,  it  is  asymtoti- 
cally  equivalent  to  a Gumbel  distribution  of  the 
type  Eq.  (7). 

However,  due  to  the  fact  that  this  distribution  is 
widely  used  in  the  analysis  of  wave  heights,  it  seems 
convenient  to  make  here  some  comments. 

Initially  we  can  say  that  this  distribution  has  the 
following  advantages: 

• For  A =0,  its  range  is  (0,  «>),  that  is,  it  does  not 
include  negative  values  of  the  random  variable. 

• Assuming  that  the  location  parameter,  due  to 
physical  reasons,  is  fixed  to  zero,  it  depends 
only  on  two  parameters.  This  makes  the  estima- 
tion process  much  simpler. 

• Its  associated  domain  of  attraction  is  Gumbel 
type.  Thus,  it  could  be  used  if  this  were  the 
actual  case. 

Its  main  drawbacks  are  the  following: 

• Its  range  is  unbounded  on  the  right.  This  con- 
tradicts the  physical  reality. 

• It  does  not  cover  the  Weibull  domain  of  attrac- 
tion that  could  be  the  real  situation. 

• It  is  an  asymptotical  minimum  law. 

• It  is  not  stable  with  respect  to  maximum  opera- 
tions. Thus,  if  the  minimal  Weibull  law  is  satis- 
fied for  yearly  maxima  the  maxima  of  periods  of 
duration  different  from  one  year  cannot  satisfy 
this  law.  This  problem  can  be  solved  by  adding 
an  extra  parameter  to  this  family,  which  leads 
to  the  extended  minimal  Weibull  family. 

Consequently,  the  minimal  Weibull  family  could 
be  used  if  and  only  if  we  were  sure  that  the  domain 
of  attraction  of  wave  heights  is  of  a Gumbel  type. 

In  order  to  determine  the  domain  of  attraction 
of  a given  distribution  several  methods  are  avail- 
able, such  as  the  Pickands’  or  the  curvature 
methods  (see  Castillo  [2]  chapter  6 and  Castillo, 
Galambos  and  Sarabia  [3|). 

3.2  Estimation  Methods 

Several  methods  have  been  used  to  estimate  the 
parameters  of  the  families  Eqs.  (7)  to  (10).  The 
most  important  are: 


• The  maximum  likelihood  method 

• The  method  of  moments 

• The  least  squares  method 

• The  probability  paper  method 

• The  Goda’s  method 

» The  percentile  method 

3.2.1  The  Maximum  Likelihood  Method  This 
method  is  based  on  maximizing  the  likelihood  of 
data  with  respect  to  the  parameters.  The  central 
idea  consists  of  assuming  that  the  sample  comes 
from  a population  with  parent  distribution  belong- 
ing to  a parametric  family  and  choosing  the 
parameter  values  that  maximize  the  probability  of 
ocurrence  of  the  sample  data. 

This  is  the  best  known  method  in  statistics  and  it 
is  recognized  as  the  most  convenient,  due  to  its 
statistical  properties.  It  leads  to  the  best  estima- 
tors, which,  in  addition,  are  asymptotically  normal. 
This  allows  asymptotic  confidence  intervals  for 
the  parameters  to  be  easily  obtained.  Using  the 
5-method,  to  be  described  later,  the  confidence 
interval  of  any  regular  function  of  the  parameters 
can  be  obtained,  too.  In  particular,  confidence 
intervals  of  percentiles  can  be  obtained  in  this 
manner. 

In  order  to  estimate  an  extreme  value  distribu- 
tion with  the  purpose  of  extrapolation  beyond  the 
data  range,  only  high  order  statistics  must  be  used 
and  the  rest  must  be  discarded.  Thus,  we  recom- 
mend the  method  indicated  by  Castillo  [2],  in 
chapter  5. 

In  the  case  of  the  minimal  and  maximal  Weibull 
families,  the  estimation  process  can  lead  to  some 
problems,  either  because  the  likelihood  function 
becomes  unbounded  1)  or  because  some  non- 
regularities, for  some  values  of  the  shape  parame- 
ter (1  < /3  < 2).  However,  it  can  be  applied  to  values 
of  the  shape  parameter  larger  than  or  equal  to  2 
without  any  problem.  Thus,  once  the  estimates  are 
available,  it  is  necessary  to  check  that  their  values 
are  consistent  with  the  initial  hypothesis.  Here  we 
give  the  following  recommendations: 

• If  the  shape  parameter  takes  a negative  value, 
this  means  that  the  data  indicate  a Frechet  type 
domain  of  attraction.  This  suggests  the  pres- 
ence of  at  least  one  outlier  that  gives  an  erro- 
neous curvature  in  the  right  tail. 

• If  we  get  a value  of  /3^  1,  we  can  think  on  the 
presence  of  outliers.  This  value  of  the  shape 
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parameter  indicates  that  the  probability  density 
function  is  increasing  in  the  tail,  which  contra- 
dicts the  physical  reality. 

• If  we  get  1 < /3  < 2 then  the  law  is  far  from  the 
Gumbel  law  (note  that  Gumbel  corresponds  to 
/3=  00). 

• If  the  value  of  the  A parameter  is  less  than  the 
maximum  of  the  sample  this  indicates  that 
there  is  an  outlier. 

3.2.2  The  Method  of  Moments  This  method 
consists  of  equating  the  moments  of  the  sample  to 
the  moments  of  the  theoretical  distribution.  We 
use  as  many  moments  as  there  are  parameters  to 
be  estimated  and  we  get  the  same  number  of  equa- 
tions from  which  the  parameters  can  be  obtained. 
The  asymptotic  properties  of  the  moment  estimates 
are  good  but  worse  than  those  associated  with  the 
maximum  likelihood  estimates. 

This  method  can  also  be  applied  to  tail  estima- 
tion, using  the  moments  of  the  truncated  distribu- 
tion. 


tribution  is  by  means  of  the  percentile  method. 
This  method  consists  of  equating  as  many  percen- 
tiles in  the  sample  and  the  theoretical  distribution 
as  the  number  of  parameters  to  be  estimated. 

As  an  illustrative  example  we  use  this  method  for 
the  estimation  of  the  parameters  of  a three  parame- 
ter maximal  Weibull  family. 

The  cdf  of  the  maximal  Weibull  distribution  is: 

G(x)  = exp[-(^-^)^j . (II) 


Thus,  the  percentile  or  order p satisfies  the  equa- 
tion 


p =exp 


from  which  we  get 


x,.  = \-8(-\ogpy'^ . 


(12) 


(13) 


3.2.3  The  Least  Squares  Method  This  method 
consists  of  minimizing  the  sum  of  squares  of  the 
differences  between  the  theoretical  and  the  empiri- 
cal values.  There  are  many  versions  of  this  method. 
In  some  cases  the  random  variable  scale  is  used  to 
measure  the  errors  and  in  other  cases  the  probabil- 
ity or  the  return  period  scales  are  used  (see  chapter 
4 of  Castillo  [2]). 

The  main  advantage  of  these  methods  is  that 
they  give  an  explicit  solution  and  do  not  depend  on 
convergence  of  any  algorithm,  as  is  the  case  with 
the  maximum  likelihood  method. 

Nevertheless,  these  methods  are  sensitive  to  the 
plotting  position  formulas  used  in  the  estimation 
method. 

3.2.4  The  Probability  Paper  Method  By  prob- 
ability paper  method  we  understand  a visual 
method,  in  which  the  data  is  drawn  on  probability 
paper  and  a straight  line  is  visually  fitted  to  data. 

The  main  drawback  of  this  method  is  that  it 
depends  on  the  plotting  position  formula  used  in 
the  graphic  representation  and  the  subjective 
criteria  for  selecting  the  optimal  fit. 

3.2.5  The  Goda  Method  Goda  [4]  fits  a mini- 
mal Weibull  distribution,  truncated  at  the 
threshold  value  xo,  to  the  right  tail  of  data.  By  right 
tail  are  meant  the  wave  heights  above  a second 
threshold  value  Xi  > >Xo. 

3.2.6  The  Percentile  Method  One  way  of  ob- 
taining quick  estimates  of  the  parameters  of  a dis- 


Equating the  three  percentiles  of  orderspi,/?2,p3 
of  sample  and  population,  we  get  the  following  sys- 
tem of  equations: 

X/,,  = A - 5 ( - log/?, ) ‘^^  ; / = 1,  2,  3 , (14) 


where  /?,  can  be  written,  using  the  Gringorten’s 
formula,  as: 


i - 0.44 
" n -K0.12’ 


(15) 


where  i is  the  rank  of  the  order  statistic  associated 
with  Pi. 

From  Eq.  (14)  we  get 


X,>2-X,n  ^ (-l0g/?2)  '^^-(-logpi) 

X,n-X,.2  (-log/?3)  ‘'^-(-l0gp2) 


which  depends  only  on  the  parameter  /3  and  thus,  it 
can  be  easily  solved  by  an  iterative  method,  as  the 
bisection  method  for  example,  with  a personal 
computer.  Once  (3  is  known,  the  values  of  A and  8 
can  be  obtained  from  any  two  of  the  equations  in 
Eq.  (14).  For  example: 

„ _ ^l’2  ~^l>l ^ 

(-log/?,)  '''’-(-logp2) 

A = X,,, +5(-log/?i) (17) 
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For  the  estimates  to  be  consistent  with  the 
model  we  must  have 


A >max(xi,X2, . . . ) (18) 


where  (x\,X2, . . . ,x„)  is  the  sample. 

If  the  percentiles  are  arbitrarily  chosen,  this 
inconsistency  can  easily  appear.  Thus,  it  is  good 
practice  to  choose  as  one  of  the  percentiles  the 
maximum  of  the  sample  X(„). 

In  addition,  if  we  are  dealing  with  a tail  estima- 
tion we  must  choose  the  adequate  percentiles,  that 
is,  percentiles  in  it. 

In  order  to  improve  the  quality  of  the  estimates 
we  can  use  three  groups  of  percentiles  instead 
of  three  percentiles,  that  is,  replace  the  system 
Eq.  (14)  by  the  system 


— 2 % 


A 


nij 


1 


i=kj 


(- log 


j = 1,2,3 


(19) 


3.2.7  Plotting  Position  Formulas  There  is 
much  controversy  about  the  plotting  position 
formulas  to  be  used  for  representing  data  on  prob- 
ability paper  and  the  posterior  estimation  by  least 
squares  methods. 

The  resulting  estimates  are  sensitive  to  the  plot- 
ting position  formulas  being  used.  This  confirms 
the  fact  that  the  least  squares  method  is  not  opti- 
mal. Note  that  maximum  likelihood  or  moment 
methods  do  not  depend  on  plotting  positions. 

The  discussion  of  the  appropriateness  of  various 
formulas  is  intended  to  avoid  or  reduce  some  of  the 
errors  involved  (in  this  case  authors  recommend 
using  formulas  leading  to  unbiased  estimators). 

However,  we  mention  here  that  all  plotting  posi- 
tion formulas  are  asymptotically  equivalent. 


3.3  5-Method 

The  5-method  (Bishop,  Fienberg,  and  Holland 
[1])  allows  us  to  obtain  confidence  intervals  of 
certain  regular  functions  of  the  parameters,  as 
functions  of  the  parameter  estimates,  and  its  vari- 
ance-covariance matrix. 

Let 


where  nij,  (j  = 1,  2,  3)  are  the  numbers  of  percen- 
tiles included  in  each  group.  With  this,  equation 

Eq.  (16)  becomes  Eq.  (20). 


J_ 

m2 


2 


l=*2 


Xp. 


J_ 

mi 


I =ifei  - 1 

i = ki 


m3 


i=k} 


Xp, 


X 

m2 


S Xp, 

i = k2 


7],  = /?,(Ai,  A2, . . . , A,);  / = 1,2, ...  ,k  (21) 

be  k functions  of  the  set  of  parameters 
Ai,  A2, . . . , Aj.  Then,  according  to  the  5-method, 


iVi,  rii,.  . .,  77i)  = (/ii(Ai,  A2, . . . , A,), 


/j2(A|,  Xj,  • ■ ■ , K),  . . . , hk  (Ai,  A2, . . . , A, ))  (22) 


/ — A:2  + m2  “ 1 


i-ki  +/7II  - 1 


“ S (-log/j,)*'^-;^  E (-logp,) 


i = k2 


i — k2  + m2~l 


(20) 


^ E (-iogpiyp--^  S (-logp,) 


i = k-t 


is  an  estimator  of  (771,  pi, . . . ,17*)  which  is  asymp- 
totically normal  and  has  mean 


(/i i(Ai,  A2, . . . , A, ),  hi{X\,  A2, . . . , Aj ), . . . , 

hk{X,,  Xi, . . . , X,))  (23) 
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and  variance-covariance  matrix 


dh\ 

dhi 

dhk 

dki 

dk\ 

'■  ^Ai 

dh\ 

dhi 

dhk 

dki 

dki 

dki 

dhi 

dhi 

dhk 

dks 

dks  ■ 

dks 

dhx 

dhi 

dhk 

dk\ 

dki  ■ ■ 

(7Ai 

dhi 

dhi 

dhk 

(7  A 2 

dki 

dki 

dh\ 

dhi 

dhk 

dks 

dks 

" dks 

where  X is^  the  variance-covariance  matrix  of 
(Ai,  A2, . . . , Ai ). 

3.4  Estimation  of  Percentiles  of  the  Maximal 
Weibull  Distribution 

As  a simple  example  of  the  5-method  we  give  the 
confidence  interval  of  one  percentile  of  the  three 
parameter  maximal  Weibull  distribution.  We  as- 
sume that  the  parameter  (3  is  larger  than  2. 

3.4.1  Point  Estimate  The  percentile  x,,  of  the 
maximal  Weibull  distribution  is: 


The  maximal  Weibull  distribution  satisfies  the 
necessary  regularity  conditions  for  the  asymptotic 
normality  if  (3^2.  Thus,  if  we  have  a sufficiently 
large  sample  coming  from  a population  with  maxi- 
mal Weibull  parent,  we  can  write: 

y/ni(\,  5,  P)-iX,8,f3))^N{Q,  X)  (28) 

where 

X^il+jy',  (29) 

where  I = {a,j)  is  the  information  matrix  associ- 

ated with  the  maximal  Weibull  family,  that  is, 

= = (30) 

<1,=  = /..= -|ir(2-i)  (31) 


an  = /«=-jr(i-i)+tr(2-i)+ir(2-i) 

(32) 

(I22  = /ii  = ^ (33) 

OB  = /«J  = -t/-(2)  (34) 

»b  = /m  = jl(l+n2))  (35) 


Xp  = \-8{-\ogpyy  (25) 

Thus,  according  to  the  invariance  principle,  the 
maximum  likelihood  estimator  of  that  percentile, 

= A-5(-logp)‘^^‘ 

3.4.2  Maximum  Likelihood  Estimators:  Asymp- 
totic Theory  We  assume  here  that  the  sample 
consists  of  those  observed  values  above  the 
threshold  value  t (type  II  censoring),  that  is,  the 
probability  density  function  is  given  by 


Mx)  = 


fix-, 8,  (3) 

1-F(r;  A,6,/3) 


(27) 


and  / and  F are  the  pdf  and  cdf  of  the  maximal 
Weibull  family. 


and  J is  the  matrix  of  the  second  order  partial 
derivatives,  with  respect  to  the  parameters  of  the 
model,  of  the  function 

G(t;  A,5,/3)  = log[l-E(r;  A,5,/3)]. 

If  we  consider  now  the  function: 


fi\,8,  f3)  = Xp  = k-8{-\ogpy'y  (36) 


with  partial  derivatives: 


/.  = 

f2  = 


11 

dk 

11 


d8 


- 1 


= -{-xogpy^ 


(37) 


/3  = = ^l0g((-l0g/7))(-l0g/3)'^'' 
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then,  by  the  6-method  we  have: 

5,  /3)  -/(A,  8,(5))^  N{Q,  X*)  (38) 

where 

X*  = (fl,f2,f3)X(fuf2,hy  , (39) 

from  which  the  confidence  interval  for  the  percen- 
tile Xp  at  level  a becomes: 

^*i/2  ^ ^*1/2 

{Xp—Zl-al2  ~',Xp+Zl-al2  (40) 

yn  \fn 


3.5  Outlier  Detection 

In  this  section  we  give  a method  to  detect  the 
presence  of  outliers  in  the  sample  data.  The  method 
is  based  on  the  fact  that  if  we  make  the  following 
change  of  variable: 

y = F{X)  (41) 


3.6  Treatment  of  Incomplete  Series 

If  we  know  about  the  existence  of  r storms  in  a 
given  series,  but  we  ignore  the  peak  intensities  we 
can  perform  an  estimate  based  on  the  known  peaks 
and  then  make  a correction  for  the  unknown 
peaks.  This  means  estimating  the  cdf  with  the 
known  peaks  and  raise  to  the  power  {n  +r)ln, 
where  n and  r are  the  number  of  known  and  un- 
known peaks,  respectively. 

4.  Critical  Analysis 

In  this  section  we  analyze  the  previous  methods 
and  discuss  some  of  their  inconsistencies. 

4.1  Inconsistencies  due  to  the  Lack  of  Stability 
With  Respect  to  Maximum  Operations 

When  several  design  methods  are  recognized  by 
the  engineering  community  a certain  consistency  in 
the  respective  results  should  be  expected.  We  shall 
see  that  this  is  not  the  case  for  some  of  the  previous 
methods. 

Let  us  assume  that  we  try  to  fit  the  minimal 
Weibull  family 


where  F{x)  is  the  cdf  of  X,  the  resulting  random 
variable,  Y,  is  uniform  17(0,1). 

In  addition  we  know  that  the  maximum  of  a 
random  sample  of  size  n coming  from  a standard 
uniform  parent  has  cdf 


F{x-,  Ao,  5o,  AO  ==  1 - exp 


(46) 


where  Au,  5o  and  ySo  are  the  parameters.  Then,  Eq. 
(2)  transforms  to 


Fy^^iy)=y''  = Prob[y^ax^y].  (42) 

We  shall  say  that  the  sample  maximum  is  one 
outlier  if  the  probability  of  being  exceeded  is  very 
small.  Thus,  the  value  yo  can  be  considered  as  criti- 
cal for  the  maximum  value  of  the  sample  if 


Fd{x',  Ao,  5o,  jSo)  — F(x',  Ao,  5(i,  /3())™  — 


1 1 - exp 


X — A(i 
5o 


/3o 


Dk 


(47) 


Probfy^ax  >yo]  = Fy^^^iyo)  = 1 -y'd  = a (43) 


and  the  wave  height  associated  with  a return  period 
T becomes  Eq.  (3): 


with  a very  small  (0.01,  0.05,  etc.).  Then,  we  get 
yo  = (!-«)""  (44) 


(48) 


This  critical  value  refers  to  the  random  variable 
y.  Thus,  we  need  to  obtain  X by  means  of  the  in- 
verse of  Eq.  (41).  As  one  example,  for  the  maximal 
Weibull  distribution  we  get 


xo  - A-5[-log(l-a)’'"]‘'^  (45) 


Let  us  assume  that  now  we  also  use  the  minimal 
Weibull  family  in  Eq.  (4): 


E,(x;  A,,3.,i3i)  = 1 - exp  { - , (49) 
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where  Ai,5i,j8i  are  the  new  parameters.  Then, 
Eq.  (5)  becomes 


Fd{x\  Ai,  5i 


exp 


(56) 


Fd{x\  Ai,  6i,  /3i)  = Fi{x\  Ai,  5i,  = 


and 


(50) 


Xr  = Ai  - 5i  log 


(57) 


and  the  wave  height  associated  with  a return 
period  T,  from  Eq.  (6),  is 


and,  taking  into  account  that 


Xt  — Ai  + 5i 


{-log[l-(l-i)]}“  (51) 


The  minimal  Weibull  model  is  inconsistent  in  the 
following  sense:  It  is  not  stable  with  respect  to  max- 
imum operations,  that  is,  when  the  cdf  is  raised  to 
a given  power  sp^l,  then,  the  resulting  cdf  is  not 
minimal  Weibull.  Thus,  though  Eq.  (46)  is  minimal 
Weibull,  Eq.  (47)  is  not  minimal  Weibull  for  Dk  9^1. 
In  other  words,  if  we  assume  a minimal  Weibull 
distribution  for  the  peaks  of  storms,  the  yearly 
maxima  cannot  be  minimal  Weibull  and  vice  versa. 
In  fact  for  equations  Eqs.  (47)  and  (48)  to  be 
identical  to  Eqs.  (50)  and  (51),  respectively,  i.e.,  for 
consistency,  we  must  have 

Ao  = Ai,  5o  = 5i,  /3o  = /3i,  k — I (52) 


the  coincidence  of  the  pairs  Eqs.  (54)-(56)  and  Eqs. 
(55)-(57)  implies 

So  = 5i  y Ao  = Ai  — 5]  log  k . (59) 


That  is,  the  coincidence  of  both  is  possible  for  any 
value  of  k. 

The  same  conclusion  is  valid  for  any  of  the 
Weibull  Eq.  (8)  or  the  Jenkinson’s  Eq.  (9)  families. 


which  implies  ^ = 1,  that  is  a mean  number  of  one 
storm  per  year,  which  is  not  the  case. 

However,  if,  instead  of  using  the  minimal  Weibull 
family  we  use  the  maximal  Gumbel  family 


Fo{x',  Ao,  5o) 


(53) 


then,  Eqs.  (47),  (48),  (50)  and  (51)  become 


Fd(x',  Ao,  5o 


exp 


(54) 


Xt  = ko-  So  log 


(55) 


4.2  Inconsistencies  Associated  With  the  Lack  of 
Stability  With  Respect  to  Truncation 

Goda’s  method  is  inconsistent  for  the  following 
reasons: 

1.  It  gives  different  estimators  for  different  values 
of  Xo. 

2.  If  the  truncated  distribution  belongs  to  the 
minimal  Weibull  family  it  cannot  belong  for  a 
different  threshold  value.  Thus,  different 
designers  using  different  threshold  values 
necessarily  arrive  to  different  models. 

In  the  following  paragraphs  we  shall  make  a de- 
tailed analysis  of  this  problem. 

With  respect  to  the  first  inconsistency  it  is  clear 
that  because  the  method  only  uses  the  data  above 
the  second  threshold  value  Xi,  the  resulting  esti- 
mates should  be  independent  on  the  first  threshold 
value  Xo. 
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In  relation  to  the  second  inconsistency,  the 
model  should  be  stable  with  respect  to  truncations. 
With  the  purpose  of  clarifying  this  idea,  let  us 
assume  that  we  choose  a family  of  candidate  distri- 
butions where  the  second  argument  y is 

one  parameter,  which,  without  loss  of  generality, 
can  be  assumed  to  be  the  threshold  value.  Then,  if 
the  wave  height  exceeding  z has  as  cdf  the  function 
H{x,z),  then,  the  wave  height  exceeding  y should 
have  a cdf  given  by 


parent  distribution  in  the  domain  of  attraction  for 
maxima  of  a Gumbel  type. 

Consequently,  as  a summary,  we  recommend  to 
fit  the  sample  data  above  the  threshold  to  one  of 
the  following  three  families: 

• If  the  domain  of  attraction  is  Weibull  type,  fit 
the  right  tail  to  the  maximal  Weibull  family 

F^{x-,  A„,  5o,  A))  = exp  { - ) ^"}  (65) 


H{x;z)-H(y;z) 


H{x;y), 


(60) 


where  the  right  hand  term  arises  from  the  consis- 
tency condition  that  expresses  that  the  family 
H{x;z)  remains  valid  for  any  value  of  the  threshold 
parameter,  w'hich  in  this  case  is  y . 

Equation  (60)  is  a functional  equation.  Its  gen- 
eral solution  can  easily  be  obtained  by  making 
z - Zo,  that  is, 


H(x;y) 


Gix)-G(y) 

1-G(y) 


where 


G(x)  = H{x;zo) . 


(61) 


(62) 


For //(x;y)  to  be  a cdf,  then  G(x)  must  also  be  a 
cdf. 

Equation  (61)  proves  that  any  consistent  family 
H{xiy)  must  come  from  another  family  G(x)  by 
means  of  a truncation  procedure. 

The  minimal  Weibull  family,  used  by  Goda,  does 
not  satisfy  this  condition.  Thus,  it  is  inconsistent. 

W^ith  the  purpose  of  having  a consistent  family  in 
the  two  previously  given  senses,  one  solution  would 
consist  of  assuming  G(x)  to  be  extended  minimal 
Weibull  with  null  location  parameter.  This  would 
imply  that  the  sample  data  above  the  threshold 
value  Xi  should  be  fitted  to  the  family 


H(x;y,  8,  p) 


lF(x;8,mT-\F(y;8,PW 

l-lF(y;S,p)Y 


, (63) 


where 


F(x;  3,^)  = 1-exp  [-(l)  j , (64) 


Nevertheless,  we  remind  the  reader  that  this 
solution  can  be  satisfactory  only  in  the  case  of  a 


• if  the  maximal  domain  of  attraction  is  Gumbel 
type,  fit  the  right  tail  to  the  maximal  Gumbel 
family 


Fo(x;  Ao,  5(1,)  = exp 


(66) 


or  to  the  extended  minimal  Weibull  family 

Fo(x;A,5,^)  = jl-exp[^--(^)  j|  ; 

x^O  (67) 

where  ft,  8 and  rj  are  the  parameters  to  be  esti- 
mated. In  the  last  case  we  are  assuming  that 
the  cdf  of  the  maximum  wave  height  in  an 
indeterminate  period,  to  be  estimated,  is  mini- 
mal Weibull. 

Note  that  fitting  the  right  tail  means  fitting  a 
truncated  model  with  basic  distribution  given  by 
Eqs.  (65),  (66)  or  (67). 

All  these  models  are  consistent  in  the  previously 
mentioned  sense. 

43  Inconsistencies  Associated  With  the  Use  of 
Hs  and  Tz 

It  is  very  common  in  the  Ocean  Engineering  field 
to  work  with  the  significant  wave  height,  Hs,  and 
period,  Tz,  as  the  basic  variables  for  extreme  value 
analysis  of  waves.  However  this  is  not  correct 
because  is  the  mean  zero  up-crossing  period  and 
Hs  is  defined  as  the  mean  of  the  1/3  largest  waves. 
These  two  random  variables  are  convenient  to  jus- 
tify normality  assumptions  in  wave  spectra,  but  can- 
not be  accepted  if  an  extreme  value  analysis  of  wave 
height,  H,  is  to  be  performed.  In  fact,  distributions 
in  different  domain  of  attraction  types  can  lead  to 
the  same  distribution  for  and/or  T , thus,  obscur- 
ing the  tail  properties  of  single  waves. 
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5.  Conclusions 

From  all  the  above  we  get  the  following  conclu- 
sions: 

1.  The  most  convenient  families  to  fit  wave  height 
data  in  the  tails  are: 

• The  maximal  Weibull  family 

• The  maximal  Gumbel  family 

However,  the  extended  minimal  Weibull  family 
can  be  used  too. 

Before  fitting  the  Gumbel  or  the  extended 
minimal  Weibull  families,  the  domain  of  attrac- 
tion for  maxima  must  be  checked  using,  for 
example,  the  Pickands  or  the  curvature  meth- 
ods. For  the  estimation  of  the  parameters, 
the  maximum  likelihood  or  the  method  of 
moments  applied  to  the  truncated  distributions 
is  recommended. 

In  the  case  of  the  maximal  Weibull  family,  the 
shape  parameter  (3  must  be  larger  than  unity.  If 
it  is  not,  the  data  suggests  an  increasing  proba- 
bility density  function  in  the  tail,  which  contra- 
dicts'the  reality. 

2.  It  is  recommended  the  elimination  of  outliers 
by  means  of  the  following  iterative  method: 

(a)  Estimate  all  parameters  with  all  data  but 
the  maximum 

(b)  Check  for  the  outlier  character  of  the 
maximum  by  the  previously  indicate 
method 

(c)  If  it  is  an  outlier,  remove  the  maximum 
and  start  the  process  again;  if  it  is  not, 
repeat  the  estimation  with  all  the  valid 
data 

3.  If  there  are  missing  data  correct  the  obtained 
cdf  by  raising  to  the  power  {n  +r)/n  where  n 
and  r are  the  number  of  known  and  unknown 
data,  respectively. 

4.  Significant  wave  height  //*  and  mean  up-cross- 
ing periods  are  not  adequate  variables  to 
analyze  the  extreme  value  behaviour  of  wave 
heights. 
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